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SEQUENCE LISTING 



) GENERAL INFORMATION: 

(i) APPLICANTS: Morrow, Casey D. and Porter, Donna, C. 

(ii) TITLE OF INVENTION: ENCAPS I DATED RECOMBINANT POLIOVIRUS 

NUCLEIC ACID AND METHODS OF MAKING AND 
USING SAME 

(iii) NUMBER OF SEQUENCES: 23 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 STATE STREET, SUITE 510 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: ASCII 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 000000 

(B) FILING DATE: 15-FEB-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/087,009 

(B) FILING DATE: 01-JUL-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGE NT INFORMATION: 

(A) NAME: Silveri, Jean M. 

(B) REGISTRATION NUMBER: P-39,030 

(C) REFERENCE /DOCKET NUMBER: UAG-004CP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TATTAGTAGA TCTG 14 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
T AC AG ATG T A CTAA 14 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 20 . . 845 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AC AC AG C AAT CAGGTCAGC CAA AAT TAC CCT ATA GTG CAG AAC ATC CAG GGG 

Gin Asn Tyr Pro lie Val Gin Asn lie Gin Gly 
15 10 

CAA ATG GTA CAT CAG GCC ATA TCA CCT AGA ACT TTA AAT GCA TGG GTA 
Gin Met Val His Gin Ala lie Ser Pro Arg Thr Leu Asn Ala Trp Val 
15 20 25 

AAA GTA GTA GAA GAG AAG GCT TTC AGC CCA GAA GTG ATA CCC ATG TTT 
Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val lie Pro Met Phe 
30 35 40 

TCA GCA TTA TCA GAA GGA GCC ACC CCA CAA GAT TTA AAC ACC ATG CTA 
Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu 
45 50 55 
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AAC ACA GTG GGG GGA CAT CAA GCA GCC ATG CAA ATG TTA AAA GAG ACC 244 
Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr 
60 65 70 75 

5 ATC AAT GAG GAA GCT GCA GAA TGG GAT AGA GTG CAT CCA GTG CAT GCA 2 92 

lie Asn Glu Glu Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala 
80 85 90 

GGG CCT ATT GCA CCA GGC CAG ATG AGA GAA CCA AGG GGA AGT GAC ATA 34 0 

10 Gly Pro lie Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp lie 
95 100 105 

GCA GGA ACT ACT AGT ACC CTT CAG GAA CAA ATA GGA TGG ATG ACA AAT 388 
Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin lie Gly Trp Met Thr Asn 
15 110 115 120 

AAT CCA CCT ATC CCA GTA GGA GAA ATT TAT AAA AGA TGG ATA ATC CTG 436 

Asn Pro Pro lie Pro Val Gly Glu lie Tyr Lys Arg Trp lie lie Leu 
125 130 135 

20 

GGA TTA AAT AAA ATA GTA AGA ATG TAT AGC CCT ACC AGC ATT CTG GAC 4 84 

Gly Leu Asn Lys lie Val Arg Met Tyr Ser Pro Thr Ser lie Leu Asp 

i!3 140 145 150 155 

%j 25 ATA AGA CAA GGA CCA AAG GAA CCC TTT AGA GAC TAT GTA GAC CGG TTC 53 2 

i n He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe 

m 160 165 170 

in 

j'^j TAT AAA ACT CTA AGA GCC GAG CAA GCT TCA CAG GAG GTA AAA AAT TGG 580 

30 Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp 
175 180 185 

Q ATG ACA GAA ACC TTG TTG GTC CAA AAT GCG AAC CCA GAT TGT AAG ACT 628 

Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr 
Q 35 190 195 200 

:'s"l 

;^ ATT TTA AAA GCA TTG GGA CCA GCG GCT ACA CTA GAA GAA ATG ATG ACA 6 76 

J'T H e L eu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr 

r "~ 205 210 215 

40 

GCA TGT CAG GGA GTA GGA GGA CCC GGC CAT AAG GCA AGA GTT TTG GCT 72 4 

Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala 
220 225 230 235 

45 GAA GCA ATG AGC CAA GTA ACA AAT TCA GCT ACC ATA ATG ATG CAG AGA 77 2 

Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr lie Met Met Gin Arg 
240 245 250 

GGC AAT TTT AGG AAC CAA AGA AAG ATT GTT AAG TGT TTC AAT TGT GGC 82 0 

50 Gly Asn Phe Arg Asn Gin Arg Lys lie Val Lys Cys Phe Asn Cys Gly 
255 260 265 

AAA GAA GGG CAC ACA GCC AGA AAG T 84 6 

Lys Glu Gly His Thr Ala Arg Lys 
55 270 275 
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<2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Gin Asn Tyr Pro 
1 

Ala lie Ser Pro 
20 

Lys Ala Phe Ser 
35 

Gly Ala Thr Pro 
50 

His Gin Ala Ala 
65 

Ala Glu Trp Asp 



Gly Gin Met Arg 
100 

Thr Leu Gin Glu 
115 

Val Gly Glu lie 
130 

Val Arg Met Tyr 
145 

Lys Glu Pro Phe 



Ala Glu Gin Ala 
180 



lie Val Gin Asn 
5 

Arg Thr Leu Asn 



Pro Glu Val lie 
40 

Gin Asp Leu Asn 
55 

Met Gin Met Leu 
70 

Arg Val His Pro 
85 

Glu Pro Arg Gly 



Gin lie Gly Trp 
120 

Tyr Lys Arg Trp 
135 

Ser Pro Thr Ser 
150 

Arg Asp Tyr Val 
165 

Ser Gin Glu Val 



lie Gin Gly Gin 
10 

Ala Trp Val Lys 
25 

Pro Met Phe Ser 



Thr Met Leu Asn 
60 

Lys Glu Thr lie 
75 

Val His Ala Gly 
90 

Ser Asp lie Ala 
105 

Met Thr Asn Asn 



lie lie Leu Gly 
140 

lie Leu Asp lie 
155 

Asp Arg Phe Tyr 
170 

Lys Asn Trp Met 
185 



Met Val His Gin 
15 

Val Val Glu Glu 
30 

Ala Leu Ser Glu 
45 

Thr Val Gly Gly 



Asn Glu Glu Ala 
80 

Pro lie Ala Pro 
95 

Gly Thr Thr Ser 
110 

Pro Pro lie Pro 
125 

Leu Asn Lys lie 



Arg Gin Gly Pro 
160 

Lys Thr Leu Arg 
175 

Thr Glu Thr Leu 
190 



Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu 

195 200 205 

Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
210 215 220 



Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin 
225 230 235 240 

Val Thr Asn Ser Ala Thr lie Met Met Gin Arg Gly Asn Phe Arg Asn 
245 250 255 
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Gin Arg Lys lie Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His Thr 
260 265 270 

5 Ala Arg Lys 
275 

(2) INFORMATION FOR SEQ ID NO : 5 : 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 94 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
20 (A) NAME/KEY: CDS 

(B) LOCATION: 4 . . 946 



• 3 t 



! ri 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



;4 25 

UJ AAC CAA TGG CCA TTG ACA GAA GAA AAA ATA AAA GCA TTA GTA GAA ATT 4 8 

Gin Trp Pro Leu Thr Glu Glu Lys lie Lys Ala Leu Val Glu lie 



0 1 5 10 15 



30 TGT ACA GAG ATG GAA AAG GAA GGG AAA ATT TCA AAA ATT GGG CCT GAA 9 6 

Cys Thr Glu Met Glu Lys Glu Gly Lys lie Ser Lys lie Gly Pro Glu 
20 25 30 



AAT CCA TAC AAT ACT CCA GTA TTT GCC ATA AAG AAA AAA GAC AGT ACT 144 
UJ35 Asn Pro Tyr Asn Thr Pro Val Phe Ala lie Lys Lys Lys Asp Ser Thr 
CO 35 40 45 



AAA TGG AGA AAA TTA GTA GAT TTC AGA GAA CTT AAT AAG AGA ACT CAA 192 
Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin 
40 50 55 60 

GAC TTC TGG GAA GTT CAA TTA GGA ATA CCA CAT CCC GCA GGG TTA AAA 24 0 

Asp Phe Trp Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys 
65 70 75 

45 

AAG AAA AAA TCA GTA ACA GTA CTG GAT GTG GGT GAT GCA TAT TTT TCA 288 
Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
80 85 90 95 

50 GTT CCC TTA GAT GAA GAC TTC AGG AAG TAT ACT GCA TTT ACC ATA CCT 3 36 

Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro 
100 105 110 

AGT ATA AAC AAT GAG ACA CCA GGG ATT AGA TAT CAG TAC AAT GTG CTT 384 
55 Ser He Asn Asn Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu 
115 120 125 
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Q 



50 



a 



CCA CAG GGA TGG AAA GGA TCA CCA GCA ATA TTC CAA AGT AGC ATG ACA 432 

Pro Gin Gly Trp Lys Gly Ser Pro Ala lie Phe Gin Ser Ser Met Thr 

130 135 140 

5 AAA ATC TTA GAG CCT TTT AGA AAA CAA AAT CCA GAC ATA GTT ATC TAT 48 0 

Lys He Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp He Val He Tyr 
145 150 155 

CAA TAC ATG GAT GAT TTG TAT GTA GGA TCT GAC TTA GAA ATA GGG CAG 52 8 

10 Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin 
160 165 170 175 

CAT AGA ACA AAA ATA GAG GAG CTG AGA CAA CAT CTG TTG AGG TGG GGA 57 6 

His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
15 180 185 190 

CTT ACC ACA CCA GAC AAA AAA CAT CAG AAA GAA CCT CCA TTC CTT TGG 624 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
195 200 205 



ATG GGT TAT GAA CTC CAT CCT GAT AAA TGG ACA GTA CAG CCT ATA GTG 672 
Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val 
210 215 220 



; 2|j CTG CCA GAA AAA GAC AGC TGG ACT GTC AAT GAC ATA CAG AAG TTA GTG 72 0 

Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val 
225 230 235 

l 5 - 

ffi GGG AAA TTG AAT TGG GCA AGT CAG ATT TAC CCA GGG ATT AAA GTA AGG 76 8 

3t) Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly He Lys Val Arg 
it 240 245 250 255 

• _J 

l! CAA TTA TGT AAA CTC CTT AGA GGA ACC AAA GCA CTA ACA GAA GTA ATA 816 

Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He 

3% 260 265 270 

W CCA CTA ACA GAA GAA GCA GAG CTA GAA CTG GCA GAA AAC AGA GAG ATT 864 

Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He 
275 280 285 

40 

CTA AAA GAA CCA GTA CAT GGA GTG TAT TAT GAC CCA TCA AAA GAC TTA 912 

Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
290 295 300 

45 ATA GCA GAA ATA CAG AAG CAG GGG CAA GGC CTCGAG 94 8 

He Ala Glu He Gin Lys Gin Gly Gin Gly 
305 310 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 
55 (B) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Gin Trp Pro Leu Thr Glu Glu Lys lie Lys Ala Leu Val Glu lie Cys 
15 10 15 

Thr Glu Met Glu Lys Glu Gly Lys lie Ser Lys lie Gly Pro Glu Asn 
20 25 ?0 

Pro Tyr Asn Thr Pro Val Phe Ala lie Lys Lys Lys Asp Ser Thr Lys 
35 40 45 

Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp 
50 55 60 

Phe Trp Glu Val Gin Leu Gly lie Pro His Pro Ala Gly Leu Lys Lys 
65 70 75 80 

Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val 
85 90 95 

Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr lie Pro Ser 
100 105 110 

lie Asn Asn Glu Thr Pro Gly lie Arg Tyr Gin Tyr Asn Val Leu Pro 
115 120 125 

Gin Gly Trp Lys Gly Ser Pro Ala lie Phe Gin Ser Ser Met Thr Lys 
130 135 140 

lie Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp lie Val lie Tyr Gin 
145 150 155 160 

Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu lie Gly Gin His 
165 170 175 

Arg Thr Lys lie Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu 
180 185 190 

Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp Met 
195 200 205 

Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro lie Val Leu 
210 215 220 

Pro Glu Lys Asp Ser Trp Thr Val Asn Asp lie Gin Lys Leu Val Gly 
225 230 235 240 

Lys Leu Asn Trp Ala Ser Gin lie Tyr Pro Gly He Lys Val Arg Gin 
245 250 255 

Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He Pro 
260 265 270 

Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He Leu 
275 280 285 
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Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu lie 
290 295 300 

5 Ala Glu lie Gin Lys Gin Gly Gin Gly Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO : 7 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
20 (A) NAME /KEY : CDS 

(B) LOCATION: 7.. 1565 

Q 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ci 25 

m 



I f| GGGGCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 4 8 

Cys Pro Lys Val Ser Phe Glu Pro lie Pro lie His Tyr Cys 
15 10 



30 GCC CCG GCT GGT TTT GCG ATT CTA AAA TGT AAT AAT AAG ACG TTC AAT 
Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn 
15 20 25 30 



96 



GGA AC A GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA 144 
35 Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly 

35 40 45 

ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA 192 
He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala 
40 50 55 60 

GAA GAA GAG GTA GTA ATT AGA TCT GTC AAT TTC ACG GAC AAT GCT AAA 240 

Glu Glu Glu Val Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys 

65 70 75 

45 

ACC ATA ATA GTA w C AG CTG AAC ACA TCT GTA GAA ATT AAT TGT ACA AGA 288 

Thr He He Val Gin Leu Asn Thr Ser Val Glu lie Asn Cys Thr Arg 

80 85 90 

50 CCC AAC AAC AAT ACA AGA AAA AGA ATC CGT ATC CAG AGA GGA CCA GGG 3 36 

Pro Asn Asn Asn Thr Arg Lys Arg He Arg He Gin Arg Gly Pro Gly 
95 100 105 110 

AGA GCA TTT GTT ACA ATA GGA AAA ATA GGA AAT ATG AGA CAA GCA CAT 3 84 

55 Arg Ala Phe Val Thr He Gly Lys He Gly Asn Met Arg Gin Ala His 

115 120 125 
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TGT AAC ATT AGT AGA GCA AAA TGG AAT AAC ACT TTA AAA CAG ATA GAT 
Cys Asn lie Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin lie Asp 
130 135 140 

AGC AAA TTA AGA GAA CAA TTC GGA AAT AAT AAA ACA ATA ATC TTT AAG 
Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr lie lie Phe Lys 
145 150 155 

CAA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG CAC AGT TTT AAT TGT 
Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys 
160 165 170 

GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT AAT AGT ACT 
Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr 
175 180 185 190 

TGG TTT AAT AGT ACT TGG AGT ACT GAA GGG TCA AAT AAC ACT GAA GGA 
Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly 
195 200 205 

AGT GAC ACA ATC ACC CTC CCA TGC AGA ATA AAA CAA ATT ATA AAC ATG 
Ser Asp Thr He Thr Leu Pro Cys Arg He Lys Gin He lie Asn Met 
210 215 220 

TGG CAG AAA GTA GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA 
Trp Gin Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin 
225 230 235 

ATT AGA TGT TCA TCA AAT ATT' ACA GGG CTG CTA TTA ACA AGA GAT GGT 
He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
240 245 250 

GGT AAT AGC AAC AAT GAG TCC GAG ATC TTC AGA CTT GGA GGA GGA GAT 
Gly Asn Ser Asn Asn Glu Ser Glu He Phe Arg Leu Gly Gly Gly Asp 
255 260 265 270 

ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA 
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys 
275 280 285 

ATT GAA CCA TTA GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG 
He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val 
290 295 300 

CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT TTG TTC CTT GGG TTC 
Gin Arg Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe 
305 310 315 

TTG GGA GCA GCA GGA AGC ACT ATG GGC GCA GCC TCA ATG ACG CTG ACG 
Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr 
320 325 330 

GTA CAG GCC AGA CAA TTA TTG TCT GGT ATA GTG CAG CAG CAG AAC AAT 
Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn 
335 340 345 350 
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TTG CTG AGG GCT ATT GAG GCG CAA CAG CAT CTG TTG CAA CTC ACA GTC 1104 

Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val 

355 360 365 

5 

TGG GGC ATC AAG CAG CTC CAA GCA AG A ATC CTA GCT GTG GAA AGA TAC 1152 

Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr 
370 375 380 

10 CTA AAG GAT CAA CAG CTC CTA GGG ATT TGG GGT TGC TCT GGA AAA CTC 12 00 

Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu 
385 390 395 

ATT TGC ACC ACT GCT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA TCT 124 8 

15 He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser 
400 405 410 

CTG GAA CAG ATC TGG AAT CAC ACG ACC TGG ATG GAG TGG GAC AGA GAA 12 96 

Leu Glu Gin He Trp Asn His Thr Thr Trp Met Glu Trp Asp Arg Glu 
20 415 420 425 430 

ATT AAC AAT TAC ACA AGC TTA ATA CAC TCC TTA ATT GAA GAA TCG CAA 13 44 

r*\ He Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin 

435 440 445 

=;H 25 

t 2 AAC CAG CAA GAA AAG AAT GAA CAA GAA TTA TTG GAA TTA GAT AAA TGG 13 92 

L ;L? Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp 

450 455 460 

Ln 

VH 30 GCA AGT TTG TGG AAT TGG TTT AAC ATA ACA AAT TGG CTG TGG TAT ATA 1440 
|^ Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He 

., 465 470 475 

lT AAA TTA TTC ATA ATG ATA GTA GGA GGC TTG GTA GGT TTA AGA ATA GTT 14 88 

^ 35 Lys Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val 
480 485 490 

h3 TTT GCT GTA CTT TCT ATA GTG AAT AGA GTT AGG CAG GGA TAT TCA CCA 15 36 

M Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro 

40 495 500 505 510 

TTA TCG TTT CAG ACC CAC CTC CCA ATC TCG AG 1568 
Leu Ser Phe Gin Thr His Leu Pro He 
515 

45 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 519 amino acids 

(B) TYPE: amino acid. 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 ; 



Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro 
5 1 5 10 . is 

Ala Gly Phe Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr 
20 25 30 

10 Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg 
35 40 45 



15 



Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu 
50 55 60 

Glu Val Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr He 

65 7 0 75 80 



He Val Gin Leu Asn Thr Ser Val Glu He Asn Cys Thr Arg Pro Asn 
20 85 90 95 

^ Asn Asn Thr Arg Lys Arg He Arg He Gin Arg Gly Pro Gly Arg Ala 

[% 100 105 110 

^ 25 Phe Val Thr He Gly Lys He Gly Asn Met Arg Gin Ala His Cys Asn 
! ;H H5 120 125 



•J I 



He Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin He Asp Ser Lys 
13° 135 140 

Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr He He Phe Lys Gin Ser 
145 150 155 160 

Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys Gly Gly 
1^5 170 175 

Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe 
180 185 190 

40 Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp 
I 95 200 205 



30 



35 



Thr He Thr Leu Pro Cys Arg He Lys Gin He He Asn Met Trp Gin 
210 215 220 

Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Arg 
225 230 235 240 

Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn 
245 250 255 

Ser Asn Asn Glu Ser Glu He Phe Arg Leu Gly Gly Gly Asp Met Arg 
260 265 270 

55 Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
275 280 285 



45 



50 
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Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
290 295 300 

Glu Lys Arg Ala Val Gly lie Gly Ala Leu Phe Leu Gly Phe Leu Gly 
305 310 315 320 

Ala Ala Gly Ser Thr Met. Gly Ala Ala Ser Met Thr Leu Thr Val Gin 
325 330 335 

Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu 
340 345 350 

Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly 
355 360 365 

He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys 
370 375 380 

Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
385 390 395 400 

Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 
405 410 415 

Gin He Trp Asn His Thr Thr Trp Met Glu Trp Asp Arg Glu He Asn 
420 425 430 

Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin 
435 440 445 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser 
450 455 460 

Leu Trp Asp Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys Leu 
465 470 475 480 

Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe Ala 
x 485 490 495 

Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser 
500 505 510 

Phe Gin Thr His Leu Pro He 
515 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
CACCCCTCTC CTACGTAACC AAGGATC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) T:PE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GTACTGGTCA CCATATTGGT CAAC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGAGAGAGAT GGG AG CTCG A GCGTC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
GCCCCCCTAT ACGTATTGTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CCAGTGAATT CCTAATACGA CTCACTATAG GTTAAAACAG C 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic- acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CTCTATCCTG AGCTCCATAT GTGTCGAGCA GTTTTTGGTT TAGCATTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Thr Lys Asp Leu Thr Thr Tyr Gly 
1 5 

(2; INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2203 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CGA CCA GCA GAC CAG ACA GTC ACA GCA GCC TTG ACA AAA CGT TCC TGG 
Arg Pro Ala Asp Gin Thr Val Thr Ala Ala Leu Thr Lys Arg Ser Trp 
15 10 15 

AAC TCA AGC ACT TCT CCA. CAG AGG AGG ACA GAG CAG ACA GCA GAG ACC 
Asn Ser Ser Thr Ser Pro Gin Arg Arg Thr Glu Gin Thr Ala Glu Thr 
20 25 30 

ATG GAG TCT CCC TCG GCC CCT CCC CAC AGA TGG TGC ATC CCC TGG CAG 
Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys lie Pro Trp Gin 
35 40 45 

AGG CTC CTG CTC ACA GCC TCA CTT CTA ACC TTC TGG AAC CCG CCC ACC 
Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 
50 55 60 

ACT GCC AAG CTC ACT ATT GAA TCC ACG CCG TTC AAT GTC GCA GAG GGG 
Thr Ala Lys Leu Thr lie Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 
65 70 75 80 

AAG GAG GTG CTT CTA CTT GTC CAC AAT CTG CCC CAG CAT CTT TTT GGC 
Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 
85 90 95 

TAC AGC TGG TAC AAA GGT GAA AGA GTG GAT GGC AAC CGT CAA ATT ATA 
Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin lie lie 
100 105 110 

GGA TAT GTA ATA GGA ACT CAA CAA GCT ACC CCA GGG CCC GCA TAC AGT 
Gly Tyr Val lie Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 
115 120 125 

GGT CGA GAG ATA ATA TAC CCC AAT GCA TCC CTG CTG ATC CAG AAC ATC 
Gly Arg Glu lie lie Tyr Pro Asn Ala Ser Leu Leu lie Gin Asn lie 
130 135 140 

ATC CAG AAT GAC ACA GGA TTC TAC ACC CTA CAC GTC ATA AAG TCA GAT 
lie Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val lie Lys Ser Asp 
145 150 155 160 

CTT GTG AAT GAA GAA GCA ACT GGC CAG TTC CGG GTA TAC CCG GAG CTG 
Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 
165 170 175 

CCC AAG CCC TCC ATC TCC AGC AAC AAC TCC AAA CCC GTG GAG GAC AAG 
Pro Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 
180 185 190 

GAT GCT GTG GCC TTC ACC TGT GAA CCT GAG ACT CAG GAC GCA ACC TAC 
Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 
195 200 205 

CTG TGG TGG GTA AAC AAT CAG AGC CTC CCG GTC AGT CCC AGG CTG CAG 
Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 



-68- 



10 



15 



20 



tl 30 



210 215 220 



CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT GTC ACA AGA AAT 72 0 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 
225 230 235 2 40 

GAC ACA GCA AGC TAC AAA TGT GAA ACC CAG AAC CCA GTG AGT GCC AGG 768 
Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 
245 250 255 

CGC AGT GAT TCA GTC ATC CTG AAT GTC CTC TAT GGC CCG GAT GCC CCC 
Arg Ser Asp Ser Val lie Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
260 265 270 

ACC ATT TCC CCT CTA AAC ACA TCT TAC AGA TCA GGG GAA AAT CTG AAC 
Thr He Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 
275 280 285 

CTC TCC TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG TAC TCT TGG TTT 912 
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 
290 295 300 



i'rf GTC AAT GGG ACT TTC CAG CAA TCC ACC CAA GAG CTC TTT ATC CCC AAC 

^ Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 

2 25 305 310 315 320 



;*[| ATC ACT GTG AAT AAT AGT GGA TCC TAT ACG TGC CAA GCC CAT AAC TCA 

I Ile Tnr v al Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 

325 330 335 

GAC ACT GGC CTC AAT AGG ACC ACA GTC ACG ACG ATC ACA GTC TAT GCA 
Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr He Thr Val Tyr Ala 
U 340 345 350 

W 35 GAG CCA CCC AAA CCC TTC ATC ACC AGC AAC AAC TCC AAC CCC GTG GAG 
Glu Pro Pro Lys Pro Phe He Thr Ser Asn Asn Ser Asn Pro Val Glu 
Q 355 360 365 

GAT GAG GAT GCT GTA GCC TTA ACC TGT GAA CCT GAG ATT CAG AAC ACA 
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu He Gin Asn Thr 
-370 375 380 



40 



ACC TAC CTG TGG TGG GTA AAT AAT CAG AGC CTC CCG GTC AGT CCC AGG 
Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arcr 
45 3 85 390 395 400 

CTG CAG CTG TCC AAT GAC AAC AGG ACC CTC ACT CTA CTC AGT GTC ACA 
Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 
405 410 415 

AGG AAT GAT GTA GGA CCC TAT GAG TGT GGA ATC CAG AAC GAA TTA AGT 
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
420 425 430 

GTT GAC CAC AGC GAC CCA GTC ATC CTG AAT GTC CTC TAT GGC CCA GAC 
Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 
435 440 445 



50 



55 



816 



864 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 
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GAC CCC ACC ATT TCC CCC TCA TAC ACC TAT TAC CGT CCA GGG GTG AAC • 1392 
Asp Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 
450 455 460 

CTC AGC CTC TCC TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG TAT TCT 
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 
465 470 475 48 0 



10 



TGG CTG ATT GAT GGG AAC ATC CAG CAA CAC ACA CAA GAG CTC TTT ATC 
Trp Leu He Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe lie 
485 490 495 

TCC AAC ATC ACT GAG AAG AAC AGC GGA CTC TAT ACC TGC CAG GCC AAT 
Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
15 500 505 510 

AAC TCA GCC AGT GGC CAC AGC AGG ACT ACA GTC AAG ACA ATC ACA GTC 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 

515 520 525 

20 

TCT GCG GAG CTG CCC AAG CCC TCC ATC TCC AGC AAC AAC TCC AAA CCC 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 

530 535 540 

"4 25 GTG GAG GAC AAG GAT GCT GTG GCC TTC ACC TGT GAA CCT GAG GCT CAG 
' " Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 

545 550 555 560 



35 



40 



45 



50 



55 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



i n 

in 

frf AAC ACA ACC TAC CTG TGG TGG GTA AAT GGT CAG AGC CTC CCA GTC AGT 

?2 30 Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 

' : 565 570 575 

Q CCC AGG CTG CAG CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
580 585 590 

GTC ACA AGA AAT GAC GCA AGA GCC TAT GTA TGT GGA ATC CAG AAC TCA 
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 
595 600 605 

GTG AGT GCA AAC CGC AGT GAC CCA GTC ACC CTG GAT GTC CTC TAT GGG 
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 
610 615 620 

CCG GAC ACC CCC ATC ATT TCC CCC CCA GAC TCG TCT TAC CTT TCG GGA 
Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 
625 "0 635 640 

GCG AAC CTC AAC CTC TCC TGC CAC TCG GCC TCT AAC CCA TCC CCG CAG 1968 
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 
645 650 655 

TAT TCT TGG CGT ATC AAT GGG ATA CCG CAG CAA CAC ACA CAA GTT CTC 2016 
Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
660 665 670 



1824 



1872 



1920 
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TTT ATC GCC AAA ATC ACG CCA AAT AAT AAC GGG ACC TAT GCC TGT TTT 2064 
Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 
675 680 685 

5 GTC TCT AAC TTG GCT ACT GGC CGC AAT AAT TCC ATA GTC AAG AGC ATC 2112 
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 
690 695 700 

ACA GTC TCT GCA TCT GGA'ACT TCT CCT GGT CTC TCA GCT GGG GCC ACT 2160 
10 Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 
705 710 715 720 

GTC GGC ATC ATG ATT GGA GTG CTG GTT GGG GTT GCT CTG ATA 22 02 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
15 725 730 

T AGC AGC CCT GGTGTAGT 222 0 



20 {2} INFORMATION FOR SEQ ID NO: 17: 

. E - (i) SEQUENCE CHARACTERISTICS: 

d[ (A) LENGTH: 734 amino acids 

(B) TYPE: amino acid 
'' T 4 25 (D) TOPOLOGY: linear 

"III (ii) MOLECULE TYPE: protein 

t ? f 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



2 30 



Arg Pro Ala Asp Gin Thr Val Thr Ala Ala Leu Thr Lys Arg Ser Trp 
1.5 10 15 



Asn Ser Ser Thr Ser Pro Gin Arg Arg Thr Glu Gin Thr Ala Glu Thr 
13 35 20 25 30 



Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys He Pro Trp Gin 

35 40 45 

40 Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 

50 55 60 

Thr Ala Lys Leu Thr He Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 

65 70 75 80 



45 



Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 
85 90 95 



Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin He He 
50 100 105 110 

Gly Tyr Val He Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 
115 120 125 

55 Gly Arg Glu He He Tyr Pro Asn Ala Ser Leu Leu He Gin Asn He 
130 135 140 
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lie Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val lie Lys Ser Asp 
145 150 155 160 

Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 
165 170 175 

Pro Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 
180 185 190 

Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 
195 200 205 

Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 
210 215 220 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 
225 230 235 240 

Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 
245 250 255 

Arg Ser Asp Ser Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
260 265 270 

Thr He Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 
275 280 285 

Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 
290 295 300 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 
305 310 315 320 

He Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 
325 330 335 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr He Thr Val Tyr Ala 
340 345 350 

Glu Pro Pro Lys Pro Phe He Thr Ser Asn Asn Ser Asn Pro Val Glu 
355 360 365 

Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu He Gin Asn Thr 
370 375 380 

Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 
385 390 395 400 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 
405 410 415 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
420 425 430 

Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 
435 440 445 
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Asp Pro Thr lie Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 
450 455 460 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 
465 470 475 4 8 o 

Trp Leu He Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe He 
485 490 495 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
500 505 510 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 
515 520 525 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 
530 535 540 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 
545 550 555 560 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 
565 570 575 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
580 585 590 

Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 
595 600 605 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 
610 615 620 

Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 
625 «0 635 640 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 
645 650 655 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
660 665 670 

Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 
675 680 685 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 
690 695 700 

Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 

705 710 ii c 

/iU 715 720 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
725 73 0 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



5 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
0 CCAGTGAATT CCTAATACGA CTACCTATAG GTTAAAACAG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATGAACCCT CGAGACCCAT TATG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCACCAAGTA CGTAACCACA TATGG 
(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
GTGAGGACTG CTGG 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACCACTGCC CTCGAGAAGC TCACTATTG 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CACCACTGCC CTCGAGAAGC TCACTATTG 
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